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WHAT IS CLAIMED IS: 



1 . A method of identifying a zinc finger domain that recognj^s a target site on a 
DNA, the method comprising: 

(a) providing cells containing a reporter construct, the construct 

4 comprising a reporter gene operably linked to a promoter, Avherein the reporter gene is 

5 expressed above a given level when a transcription facta* recognizes both a recruitment site 

6 and a target site of the promoter, but not when the transcription factor recognizes only the 

7 recruitment site of the promoter; 

8 (b) providing a plurality oftfybrid nucleic acids, each of which encodes a 

9 non-naturally occurring protein comprising/^) a transcription activation domain, (ii) a DNA 

10 binding domain that recognizes the recraWment site, and (iii) a test zinc finger domain, 

1 1 wherein the encoded amino acid sequence of the test zinc finger domain varies among the 
2 12 members of the plurality; 

% 4 13 (c) contacting the plurality of hybrid nucleic acids with the cells under 

iff 5 / 

m 14 conditions that permit at le^st one of the plurality of nucleic acids to enter at least one of the 

yi 15 cells; 

16 (d) /maintaining the cells under conditions permitting expression of the 

17 hybrid nucleic acids in the cells; and 
^18 C€) identifying a cell that contains a hybrid nucleic acid of (b) and that 
! 19 expresses tWreporter gene above the given level as an indication that the cell contains a 
f 20 hybrid nucleic acid encoding a test zinc finger domain that recognizes the target site. 
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2. The method of claim 1, wherein the cells are eukaryotic cells. 



3. The method of claim 2, wherein the cells are yeast cells. 



4. The method of claim 3, wherein the cells are Saccharomyces cerevisiae cells. 



5. The method of claim 1, wherein the reporter gene is a selectable marker. 
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6. The method of claim 5, wherein the selectable marker is selected from the group 
consisting of URA3, HIS3, LEVI, ADE2, and TRPL 

7. The method of claim 1, wherein the reporter gene is selected from the group 
consisting of lacZ, CAT, luciferase, GUS, and GFP. 

8. The method of claim 1, wherein the DNA binding domain comprises a zinc finger 
domain. 



9. The method of claim 8, wherein the DNA binding domain comprises two zinc 
finger domains. 

10. The method of claim 9, wherein the DNA binding domain comprises three zinc 
finger domains. 

1 1 . The method of claim 1 , further comprising the steps of (i) amplifying a source 
nucleic acid encoding the test zinc finger domain from genomic nucleic acid, a messenger 
RNA (mRNA) mixture, or a complementary DNA (cDNA) mixture, using an oligonucleotide 
primer that anneals to a sequence encoding a conserved domain boundary to produce an 
amplified fragment; and (ii) utilizing the amplified fragment to construct a hybrid nucleic 
acid for inclusion in the plurality of hybrid nucleic acids of step (b). 

12. The method of claim 1, further comprising the steps of (i) identifying a candidate 
zinc finger domain amino acid sequence in a sequence database; (ii) providing a candidate 
nucleic acid encoding the candidate zinc finger domain amino acid sequence, and (iii) 
utilizing the candidate nucleic acid to construct a hybrid nucleic acid for inclusion in the 
plurality of hybrid nucleic acids of step (b). 

13. The method of claim 5, wherein the selectable marker is an auxotrophy gene 
required for the synthesis of a metabolite; the genome of the cells lacks a functional copy of 
the auxotrophy gene; and, during step (d), the cells are maintained in a medium prepared 
without the metabolite. 
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14. The method of claim 1, wherein steps (a) to (f) are repeated to identify a second 
test zinc finger domain that recognizes a second target site. 

15. The method of claim 14, further comprising constructing a nucleic acid encoding 
a polypeptide comprising the first test zinc finger domain and the second test zinc finger 
domain. 



16. A method of identifying a zinc finger domain that recognizes a target site on a 
DNA, the method comprising: / 

(a) providing cells containing a reporter construcyfhe construct comprising a 

4 reporter gene operably linked to a promoter, wherein the reporter gene is expressed above a 

5 given level when a transcription factor recognizes both^a recruitment site and a target site of 

6 the promoter, but not when the transcription factoi^ecognizes only the recruitment site of the 

7 promoter; / 

8 (b) amplifying a plurality of nudeic acid sequences, each of which encodes a test 

9 zinc finger domain, using an oligonucleotide primer that anneals to a nucleic acid encoding a 

1 0 conserved domain boundary ; / 

1 1 (c) joining each nucleic acid sequence of (b) to nucleic acid sequences encoding 

12 (i) a transcription activation domain, and (ii) a DNA binding domain that recognizes the 

13 recruitment site, to form a plurality of hybrid nucleic acids; 

14 (d) contacting the plurality of hybrid nucleic acids of (c) with the cells of (a) 

15 under conditions that/permit at least one of the plurality of hybrid nucleic acids to enter at 

16 least one of the cells; 

17 (e) >maintaining the cells under conditions permitting expression of the hybrid 

18 nucleic acids in the cells; and 

19 (f) identifying a cell that contains a hybrid nucleic acid of (c) and that expresses 

20 the rejwrter gene above the given level, wherein the hybrid nucleic acid encodes a zinc finger 

21 domain that recognizes the target site on a DNA. 

1 17. The method of claim 16, wherein the cells are yeast cells. 
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18. The method of claim 16, wherein the reporter gene is selected from the group 
consisting of lacZ, CAT, luciferase, GUS, and GFP. 

19. The method of claim 16, wherein the DNA binding domain comprises a zinc 
finger domain. 

20. The method of claim 19, wherein the DNA binding domain comprises two zinc 
finger domains. 



21 . A method of determining whether a test zinc finger domain/ecognizes a target 
site on a promoter, the method comprising: 

(a) providing a reporter construct comprising^ reporter gene operably 
linked to a promoter, wherein the reporter gene is expressed aoove a given level when a 
transcription factor recognizes both a recruitment site ancLa target site of the promoter, but 
not when the transcription factor recognizes only the recruitment site of the promoter; 

(b) providing a hybrid nucleic acid that encodes a non-naturally occurring 
protein comprising (i) a transcription activation'aomain, (ii) a DNA binding domain that 
recognizes the recruitment site, and (iii) ^at^^tzinc finger domain; 

(c) contacting the reporter construct with a cell under conditions that 
permit the reporter construct to enter'me cell; 

(d) prior to, after, or concurrent with step (c), contacting the hybrid 
nucleic acid with the cell under" conditions that permit the hybrid nucleic acid to enter the 
cell; 

(e) ^maintaining the cell under conditions permitting expression of the 
hybrid nucleic acidin the cell; and 




(/i) detecting reporter gene expression in the cell, wherein a level of 
reporter gene/expression greater than the given level is an indication that the test zinc finger 
domain recognizes the target site. 
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22. The method of claim 21 , further comprising the step of amplifying a nucleic acid 
encoding the test zinc finger domain from genomic DNA, an mRNA mixture or a cDNA 
mixture using an oligonucleotide primer that anneals to a sequence encoding a conserved 
domain boundary. 

23. The method of claim 21, further comprising the steps of (i) identifying a 
candidate zinc finger domain amino acid sequence in a sequence database; (ii) providing a 
candidate nucleic acid encoding the candidate zinc finger domain amino acid sequence, and 
(iii) utilizing the candidate nucleic acid to construct a hybrid nucleic acid for inclusion in the 
plurality of hybrid nucleic acids of step (b). 



24. A method of determining whether a test zinc ^ger domain recognizes a target 
site on a promoter, the method comprising: 

(a) providing a first cell comprising a reporter construct comprising a 
reporter gene operably linked to a promoter, wirerein the reporter gene is expressed above a 
given level when a transcription factor recognizes both a recruitment site and a target site of 
the promoter, but not when the transcription factor recognizes only the recruitment site of the 
promoter; 

(b) providing,/ second cell comprising a hybrid nucleic acid that encodes 
a protein comprising (i) a transcription activation domain, (ii) a DNA binding domain that 
recognizes the recmitment/binding site, and (iii) a test zinc finger domain; 

(c) fusing the first and second cells to form a fused cell; 

(d) / maintaining the fused cell under conditions permitting expression of 
the hybrid nucleic acids in the cell; and 

/ (e) detecting reporter gene expression in the fused cell, wherein a level of 
reporter/gene expression greater than the given level is an indication that the test zinc finger 
domain recognizes the target site. 



25. The method of claim 24 wherein the first and second cells are yeast cells of the 
opposite mating types. 
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26. A method of determining whether a test zinc finger domain recognizes a target 
site on a promoter, the method comprising: / 

(a) providing a plurality of reporter constructs, each construct comprising 
a reporter gene operably linked to a promoter, wherein thpreporter gene is expressed above a 
given level when a transcription factor recognizes botjafa recruitment site and a target site of 
the promoter, but not when the transcription factoprccognizes only the recruitment site of the 
promoter; / 

(b) providing a cell containing a hybrid nucleic acid, that encodes a non- 
naturally occurring protein comprising^) a transcription activation domain, (ii) a DNA 
binding domain that recognizes the recruitment site, and (iii) a test zinc finger domain; 

(c) contacting the plurality of reporter constructs with the cell under 
conditions that permit at least one of the plurality of reporter constructs to enter the cell; 

(d) maintaining the cell under conditions permitting expression of the 
hybrid nucleic acid in/the cell; and 

{€)/ identifying a cell that contains a reporter gene of (a) and that expresses 
the reporter gene above the given level as an indication that the reporter construct in the cell 
comprises a^arget site recognized by the test zinc finger domain. 

27. The method of claim 26, wherein the target binding site is between two and six 
nucleotides long. 

28. The method of claim 27, wherein the plurality of reporter constructs comprises 
every possible combination of A, T, G, and C nucleotides at at least two positions of the 
target binding site. 

29. The method of claim 28, wherein the plurality of reporter constructs comprises 
every possible combination of A, T, G, and C nucleotides at at least three positions of the 
target binding sites. 
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30. The method of claim 26, wherein steps (a) to (e) are repeated for a second test 
zinc finger domain to identify a second binding preference. 

3 1 . The method of claim 30, further comprising constructing a nucleic acid encoding 
a polypeptide comprising the first second test zinc finger domains. 

32. A method of identifying a plurality of zinc finger domains, the method 
comprising: 

carrying out the method of claim 1 to identify a first test zinc finger domain; and 

carrying out the method of claim 1 again to identify a second test zinc finger domain 
that recognizes a target site different from the target site recognized by the first test zinc 
finger domain. 

33. A method of generating a nucleic acid encoding a chimeric zinc finger protein, 
the method comprising: 

carrying out the method of claim 32; 

constructing a nucleic acid encoding a polypeptide comprising the first and 
second test zinc finger domains. 

34. A method of identifying DNA sequences recognized by zinc finger domains, the 
method comprising: 

carrying out the method of claim 24 to identify a first target site recognized by a first 
test zinc finger domain; and 

carrying out the method of claim 24 again to identify a second target site recognized 
by a second test zinc finger domain. 

35. A method of generating a nucleic acid encoding a chimeric zinc finger protein, 
the method comprising: 
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carrying out the method of claim 34; 

constructing a nucleic acid encoding a polypeptide comprising the first and 
second test zinc finger domains. 

36. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2- 5 -Cys-X3-Xa-X-Cys-X-Ser-Asn-X b -X-Arg-^-X 3 .5-His 

(SEQ IDNO:68), 

wherein X a is phenylalanine or tyrosine, and X/ is a hydrophobic residue. 

37. A nucleic acid comprising a sequence encoding the polypeptide of claim 36. 

38. A purified polypeptide comprising the ammo acid sequence: 
Xa-X-Cys-X 2 .5-Cys-X3-Xa-X-His-X-Ser-As90C b -X-Lys-His-X3-5-His 

(SEQ ID NO:69), 

wherein X a is phenylalanine or tyrosine, and X b is a hydrophobic residue. 

39. A nucleic acid comprising a sequence encoding the polypeptide of claim 38. 

40. A purified polypeptide comprising the amino acid sequence: 
X a -X-Cys-X2-5-Cys-X3-X a -X-S9f-X-Ser-Asn-Xb-X-Arg-His-X 3 -5-His 

(SEQ ID NO:70), 

wherein X a is phenylalanine or tyrosi^Vn^tb is a hydrophobic residue. 

41. A nucleic acid comprising a sequence encoding the polypeptide of claim 40. 

42. A purified polypeptide comprising the amino acid sequence: 
X a -X-Cys-X 2 -5-Cys-^ 3 -X^ 

(SEQ ID NO:71), 

wherein X^ is phenylalanine or tyrosine, and X b is a hydrophobic residue. 

43. A nucleic acid comprising a sequence encoding the polypeptide of claim 42. 



44. A purified polypeptide comprising the amino acid sequence: 
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Xa-X-Cys-X2.5-Cys-X3-Xa-X-Val-X-Ser-Xc-Xb-X-Arg-His-X 3 .5-His(^EQ ID NO:72), 
wherein X a is phenylalanine or tyrosine, Xb is a hydrophobe residue, and X ( 
is serine or threonine. 

45. A nucleic acid comprising a sequence encoding the polypeptide of claim 44. 

46. A purified polypeptide comprising the amino acicr sequence: 

X a -X-Cys-X2.5-Cys-X 3 -X a -X^^ 
(SEQ ID NO:73), 

wherein X a is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

47. A nucleic acid comprising a sequence encoding the polypeptide of claim 46. 

48. A purified polypeptide comprising the amino acid sequence: 

Xa-X-Cys-X 2 -5-Cys-X 3 -Xa^^ 
(SEQIDNO:74), / 

wherein X a is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

49. A nucleic acid comprising a sequence encoding the polypeptide of claim 48. 

50. A purified polypeptide comprising ^^amjjlo acid sequence: 

X a -X-Cys-X 2 . 5 -Cys-;4-X^ 
(SEQ ID NO:75), 

wherein ^ a is phenylalanine or tyrosine, Xb is a hydrophobic residue, and X ( 
is serine or threonine. 



5 1 . A nucleic acid comprising a sequence encoding the polypeptide of claim 50. 

52. A purified polypeptide comprising an amino acid sequence 60% identical to SEQ 
ID NO:65. 



53. 



nucleic acid comprising a sequence encoding the polypeptide of claim 52. 
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54. A purified polypeptide, comprising an amino acid sequence 60% identical to an 
amino acid sequence selected from the group consisting of: SEQ ID NO:^9, 127, 129, 131, 
133, and 135. 

55. A nucleic acid, comprising a sequence encoding the polypeptide of claim 54, 

56. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X 2 -5-Cys-X 3 ^ 

(SEQ ID NO: 150), 

wherein X a is phenylalanine or tyrosine, ajid Xb is a hydrophobic residue. 



57. A nucleic acid comprising a sequence encoding the polypeptide of claim 56. 

58. A purified polypeptide comprising the amino acid sequence: 
X a -X-Cys-X2-5-Cys-X 3 -X a -X-Q^ 

(SEQ ID NO: 151), 

wherein X a is phenylalanine^ tyrosine, and Xb is a hydrophobic residue. 



59. A nucleic acid comprising/a sequence encoding the polypeptide of claim 58. 

60. A purified polypeptideyComprising the amino acid sequence: 
Xa-X-Cys-X 2 -5-Cys-X 3 -^X^ 

(SEQ ID NO: 152), 

wherein X a is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 



61 . A nucleic acid comprising a sequence encoding the polypeptide of claim 60. 

62. A purified/polypeptide comprising the amino acid sequence: 
Xa-X-Cys-x/ 5 -Cys-X3-X a -X-Gln-X-Ser-His-Xb-X-Val-His-X 3 -5-His 

(SEQ ID NO: 153). 

wfierein X a is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 
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63. A nucleic acid comrising a sequence encoding the polypeptide oyclaim 62. 

64. A purified polypeptide comprising the amino acid sequencer 
X a -X-Cys-X2.5-Cys-X 3 -Xa-X-Gln-X-Ser-Asn-Xb-X-Ile-His-Xy5-His 

(SEQ ID NO: 154), 

wherein X a is phenylalanine or tyrosine, and X\>ida hydrophobic residue, 

65. A nucleic acid comprising a sequence encoding me polypeptide of claim 64. 

66. A purified polypeptide comprising the amino acid sequence: 

X a -X-Cys-X2-5-Cys-X 3 -X a -X-Gln^^ 
(SEQ ID NO: 155), 

wherein X a is phenylalanine or tvfosine, and Xb is a hydrophobic residue. 



67. A nucleic acid comprising a sequence encoding the polypeptide of claim 66. 



68. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2.5-Cys-X3-Xa-X7Gln-X-Thr-His-Xb-X-Gln-His-X3.5-H 



(SEQ ID NO: 156), / S\ 

wherein X a is phenylalanine or (jg^sinp, and Xb is a hydrophobic residue. 

69. A nucleic acid comprising a sequence encoding the polypeptide of claim 68. 

70. A purified polypeptide comprising the amino acid sequence: 

Cys-X 2 -5-/ys-X 3 -X a -X-G^ 
(SEQ ID NO: 157), 

wherein X a is phenylalanine or tyrosine, and Xb is a hydrophobic residue. 

71 . A nucleic acid comprising a sequence encoding the polypeptide of claim 70. 
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72. A purified polypeptide comprising the amino acid sequence: 
Xa-X-Cys-X2-5-Cys-X3-X a -X-Arg-X-Asp-Lys-Xb-X-Ile-His-X3-57flis 

(SEQ ID NO: 158), 

wherein X a is phenylalanine or tyrosine, and Xb is ^hydrophobic residue. 

73. A nucleic acid comprising a sequence encoding the/polypeptide of claim 72. 

74. A purified polypeptide comprising the amino add sequence: 
X a -X-Cys-X2-5-Cys-X3-Xa-X-Arg.X-Ser-Asn-Xb-^Arg-His-X3-5-His 

(SEQ ID NO: 159), 

wherein X a is phenylalanine or tyrosine?, and Xb is a hydrophobic residue. 



75. A nucleic acid comprising a sequence'encoding the polypeptide of claim 74. 

76. A purified polypeptide comprising/an amino acid sequence 60% identical to SEQ 
ID NO:141. 

77. A nucleic acid comprising a sequence encoding the polypeptide of claim 76. 

78. A purified polypeptide comprising an amino acid sequence 60% identical to SEQ 
ID NO: 107. 

79. A nucleic acid comp/ising a sequence encoding the polypeptide of claim 78. 



80. A purified polypeptide comprisirfe an amino acid sequence 60% identical to SEQ 
IDNO:137. ' CA-^ 



81 . A nucleic acid comprising a sequence encoding the polypeptide of claim 80. 

82. A purified'polypeptide comprising an amino acid sequence 60% identical to SEQ 



ID NO:145. 



83. A nucleic acid comprising a sequence encoding the polypeptide of claim 82. 
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84. A purified polypeptide comprising an amino acid^quence 60% identical to SEQ 
ID NO: 149. 

85. A nucleic acid comprising a sequence^ncoding the polypeptide of claim 84. 
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